On Optimal Pairwise Linear Classi ers for Normal Distributions: The Two-Dimensional Case
نویسندگان
چکیده
Computing linear classi ers is a very important problem in statistical Pattern Recognition (PR). These classi ers have been investigated by the PR community extensively since they are the ones which are both easy to implement and comprehend. It is well known that when dealing with normally distributed classes, the optimal discriminant function for two-classes is linear only when the covariance matrices are equal. Other approaches, such as the Fisher's discriminant, the perceptron algorithm, minimum square distance classi ers, etc., have solved this problem by generating a linear classi er in normal and non-normal distributions, but these classi ers are typically suboptimal. In this paper we shall focus on some special cases of the normal distribution with non-equal covariance matrices. We present a complete analysis of the case when the classi er is pairwise linear, and to our knowledge this is a pioneering work for the use of such classi ers in any are of statistical PR. We shall determine the conditions that the mean vectors and covariance matrices have to satisfy in order to obtain the optimal linear classi er. However, as opposed to the state of the art, in all the cases discussed here, the linear classi er is given by a pair of straight lines, which is a particular case of the general equation of second degree. One of these cases is when we have two overlapping classes with equal means, which is a general case of the Minsky's Paradox for the Perceptron. We present a general linear classi er for this particular case which can be obtained directly from the parameters of the distribution. Numerous other analytic results for two dimensional normal random vectors have been derived. Finally, we have also provided some empirical results in all the cases, and demonstrated that these linear classi ers achieve very good performance.
منابع مشابه
Classifier performance as a function of distributional complexity
When choosing a classi cation rule, it is important to take into account the amount of sample data available. This paper examines the performances of classi ers of di5ering complexities in relation to the complexity of feature-label distributions in the case of small samples. We de ne the distributional complexity of a feature-label distribution to be the minimal number of hyperplanes necessary...
متن کاملA one-dimensional analysis for the probability of error of linear classifiers for normally distributed classes
Computing the probability of error is an important problem in evaluating classi ers. When dealing with normally distributed classes, this problem becomes intricate due to the fact that there is no closedform expression for integrating the probability density function. In this paper, we derive lower and upper bounds for the probability of error for a linear classi er, where the random vectors re...
متن کاملCoulomb Classi ers: Generalizing Support Vector Machines via an Analogy to Electrostatic Systems
We introduce a family of classi ers based on a physical analogy to an electrostatic system of charged conductors. The family, called Coulomb classi ers, includes the two best-known support-vector machines (SVMs), the {SVM and the C{SVM. In the electrostatics analogy, a training example corresponds to a charged conductor at a given location in space, the classi cation function corresponds to the...
متن کاملMulti-class Classification in Big Data
The paper suggests the on-line multi-class classi er with a sublinear computational complexity relative to the number of training objects. The proposed approach is based on the combining of two-class probabilistic classi ers. Pairwise coupling is a popular multi-class classication method that combines all comparisons for each pair of classes. Unfortunately pairwise coupling su ers in many cases...
متن کاملThe performance of regularized discriminant analysis versus non-parametric classifiers applied to high-dimensional image classification
Classi® cation of very-high-dimensional images is of the utmost interest in remote sensing applications. Storage space, and mainly the computational e ort required for classifying these kinds of images, are the main drawbacks in practice. Moreover, it is well known that a number of spectral classi® ers may not be useful (even not valid) in practice for classifying very-high-dimensional images....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002